Estimating the First Selected PSU Mean in a Two Stage Cluster Sample with Unequal Cluster Sizes based on an Expanded PSU Model

نویسنده

  • Ed Stanek
چکیده

Scott and Smith (1969) develop estimators for linear functions from a two stage cluster sample, with their discussion repeated in many places. This discussion was reviewed in c01ed13.doc. It is note-worthy that Scott and Smith allow expressions for the variance to depend on the cluster. A similar development is given by Vallient et al.. We repeated the derivation of Vallient et al using the finite population variance in c01ed15.doc. These results were extended to settings with response error in c01ed25.doc. Both derivations are based on a ‘collapsed’ version of a vector of random variables, similar to the vector of random variables usually considered. In the absence of response error, we developed results like those in c01ed15.doc in the context of an expanded PSU model with equal cluster sizes in c01ed27.doc. The results in c01ed27.doc have equal cluster sizes in the population, and produce the same estimates as the results in c01ed15.doc. However, new ideas are introduced in c01ed27.doc that we anticipate make it possible to expand the development to unequal size PSUs. The new ideas involve somewhat ‘arbitrary’ projections of terms in estimating equations to make the resulting equations of full rank and invertable. The projections are selected to make the estimating equations reduce to the estimating equations in c01ed15.doc. The purpose of this document is to extend the results in c01ed27.doc to unequal size PSUs. The documents c01ed15.doc and c01ed27.doc assume that the number of SSUs in a PSU is equal for all PSUs. In an additional document, the expected value and variance is developed for a two stage problem with unequal size PSUs with response error (see c01ed26.doc). However, the document c01ed26.doc stops short of estimation. In order to clearly construct the estimating equations, the un-balance in PSU sizes has to be accounted for. It seems that this accounting will be more straightforward in an expanded population framework. With this in mind, we develop an estimator for a cluster mean under two stage sampling with unequal size PSUs in an expanded population framework. We consider here the simpler setting where we have no response error. We expand the representation of the PSUs, but not of the SSUs. The basic problem that we consider is estimation of a linear combination of population values based on the observed response for subjects and SSUs selected via two stage cluster sampling. The problem can be summarized with a model for the vector of random variables that represents a random permutation of the population. Detailed development and notation is given elsewhere, in particular in documents c00ed57.doc, c01ed15.doc, and c01ed27.doc.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Estimating the First Selected PSU Mean in a Two Stage Cluster Sample- Expanding the PSUs

Scott and Smith (1969) develop estimators for linear functions from a two stage cluster sample, with their discussion repeated in many places. This discussion was reviewed in c01ed13.doc. It is note-worthy that Scott and Smith allow expressions for the variance to depend on the cluster. A similar development is given by Vallient et al.. We repeated the derivation of Vallient et al using the fin...

متن کامل

A Model for a Nested Study

We present notation that can be used to express a simple model for a nested study. The model is based on the assumption that response has been reported for subjects selected via a two stage cluster sample. At the first stage, a simple random sample of clusters (which we will also refer to as primary sampling units (PSU)) is selected. At the second stage, a simple random sample of subjects (whic...

متن کامل

Comparisons of EMSE of Predictors for a PSU Mean with Unequal Size Cluster Sampling

Expressions for the EMSE for the RP model predictor were developed in the setting where the variance of units within clusters is equal for all cluster, and clusters and sample size per cluster are of equal size. We consider this setting here since the development should be simpler. In this context, the sample mean, the mixed model predictor, Scott and Smith’s predictor, and the random permutati...

متن کامل

Predicting Random Effects from a Finite Population of Unequal Size Clusters based on Two-Stage Sampling

Prediction of random effects is an important problem with expanding applications. In the simplest context, the problem corresponds to prediction of the latent value (the mean) of a realized cluster selected via two-stage sampling. Best linear unbiased predictors developed from mixed models are widely used, but their development requires distributional assumptions or an infinite population frame...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001